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(54) rule: METHOD AND APPARATUS FOR PROCESSING CLINICAL TRIAL DATABASES 




(57) Abstract 

A method and apparatus for facilitating review of clinical data provides a poini-and-click, menu-drtvcn approach for reviewing, 
analyzing, and graphir\g clinical data (30, 32, 34, 36, 38 and 40). Real-time access to clinical information in electronic databases is 
provided so as to pcrmil clinicians to obtain information in a timely manner. A nstr is provided with the ability to browse pavient profiles 
(31). perfoTm data queries, reclassify numeric and charaacr variablcj, create and analyze variable subgroups (74), create basic summary 
tables (42) and graphic dau displays (58). The invention eliminates the time-consuming and problematic conversion step. The invention 
has the ability to export statistical databases to popular existing data formats, No progrBmming experience is required for using this system. 
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METHOD AND APPAJRATUS FOR PROCESSING CLINICAL TRIAL 

DATABASES 

5 

REFERENCE TO MATERIAL SUBJECT TO COPYRIGHT PROTECTION 
A portion of the disclosure of this patent document contains material which is 
subject to copyright protection. The copyright owner has no objection to the facsimile 
reproduction by anyone of the patent disclosure, as it appears in the Patent and 
10 Trademark Office patent files o: records, but oihenvise reserves all copyrights 
whatsoever. 

REFERENCE TO A MICROFICHE APPENDIX 
A niicrofiche appendix containing program code and a user's manual 
1 5 corresponding to the program code, comprising nine fiches and 580 frames, is submitted 
herevyith. 

BACKGROUND OF THE INVENTION 

20 L Field of the Invention 

The present invention relates to databases and, more specifically, to a method of 
processing clinical trial databases for users without a background in statistics. 

F ' H 

25 2. Description of the Prior Art . 

Generally, examination of information in clinical trial databases has been 
accomplished either by manual tabulation of information contained in written reports 
summarizing the database, or by creating project-specific or inquiry-specific computer 
30 programs to perform selected functions to extract desired information The manual 
cross-tabulation procedures are inadequate under normal circumstances, because the 
questions posed by the user are usually of such a specific nature that they are not likely 
to be answered by reorganization of information routinely contained in normal written 
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reports. At the same lime, where users are required to create project-specific computer 
programs, to extract information from these databases, the facility of the exploratory 
analysis may be hampered either by the user's lack of prograrmning sophistication and 
experience, or by the need of the user to involve at least one other person (i.e., a 

5 programmer or biostatistician) in the process of creating necessary programming code. 
Even then, certain additional tasks are required. These tasks typically include: (1) 
writing computer code specific to questions to be asked of the database; (2) creating 
unique fomnats for tables and figures, to contain information summarized from the 
database; (3) preparing additional computer programming code to facilitate the 

10 exportation of summarized information (i.e., statistical output, tables, figures) to word 
processing or other application's software, for preparation of reports and other 
summaries of the information. Generally, this process may take several days to develop 
the programming code and create the database inquiry. 

1 5 These methods have the disadvantages of inefficiency in the manual 

cross-tabulation process and loss of precision associated with the process of computer 
programming by a tlurd party. In particular, the original question of interest lo the user 
may not be adequately or completely answered by the computer programming code 
because of; (1) misunderstandings by the programmer of the user's desired information; 

20 (2) inability of the user to articulate to another individual exactly what they are 

interested in learning; (3) or incompatibilities between database software and word 
processing software. Furthermore, where users have an immediate need for information 
and data summarization, a process which requires several days for creation of computer 
code in order to answer questions of interest may significantly limit the user's ability to 

25 employ data and information efficiently and effectively in a clinical research 
decision-making. 

In addition, where information contained in a clinical trial database is sensitive, 
or subject to restrictions on access limited to only a few individuals within an 
30 organization, the need to involve computer programmers and biostatisticians, who may 
not be authorized to have access to the database, creates problems with confidentiality 
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of information which may compromise the effective conduct of business by the 
oreanization usinp th? d5.1AbEEe. 

Other computer-based systems have been developed to provide ihe facility for 
5 'real-time' interrogation of electronic databases. However, these systems are frequently 
designed by computer programmers and biostaiisticians, and are intended for use by this 
same population of individuals, rather than by clinical personnel without substantive 
statistical or programming background. Consequently, the computer program interfece 
between the database and the user may confuse or intimidate the user, who is neither a 
10 programmer nor a biostatistician. 

Thus, there exists a need for a method and apparatus to facilitate efficient, 
reliable, and rapid end-user exploration of clinical trial databases to answer specific 
questions related to the information contained in the database. There also exists a need 
15 for a process in which the end-user directly interfaces with the database, using software 
that offers the ability to cross-tabulate information contained in the database, perform 
summary descriptive statistical analyses, and generate associated tables and charts. 

SUMMARY OF THE INVENTION 
20 . 

The disadvantages of the prior art are overcome by the present invention, which 
in one aspect is a method for exploring, examining, and summarizing information 
contained in an electronic database. Furthermore, because the invention provides an 
interface between the non-statistician, non-programmer user and the database, the 
25 invention provides for real time examination of databases. The invention provides the 
user with the ability to create subgroups and subsets of the data, merge these identified 
subgroups, reclassify data contained in the database, and create summary reports, 
including tables and graphs, of the data. 



30 



The invention includes a window-driven application designed for clinical data 
review. This system is highly intuitive and user-friendly and provides a point-and-click 
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menu-driven approach for reviewing, analyzing, and graphing clinical data. Potential 
users include FDA ciinicai reviewers of computer aided new drug applications and 
clinical staff at pharmaceutical companies. The invention permits clinicians to obtain 
information in a timely manner. 



The invention is designed to accommodate dilBferent database structures, ll 
automatically recognizes character and numeric variables to create different inquiry 
statements, which are imponant for programmers and biostatisticians, but not for users. 
The invention shows formatted values and labels, instead of raw values, which have 
10 limited meaning to users. It allows users to change variable and data set names into 

more descriptive names and to label the data sets and variables. It validates the database 
structure to be used in different . modules to avoid user mistakes. It accommodates 
different types and names of key variables automatically for data merging once they are 
set up by an administrator. 

15 

An administrator controls the information to be reviewed, including study 

I 

protocol, drug name, and indication levels, establishes user identification, password, and 
working directories for each user; arranges data set and variable names to be used in the 
Adverse Event Module; and arranges a key variable for data merging and subgrouping. 
20 This setup ensures system security and integrity. 

The user can perform complicated inquiries without writing any code with the 
SAS* programming language. Traditionally, the user needed to know SAS 
programming, data format conventions, and database terminology in order to subset a 
25 data set in SAS. With the Subgrouping Module, the user can extract data with criteria 
he sets up online. The user can also create a subgroup with only the subject 
identification list and can use the data joining fimction later. 



30 



Using the SAS/GRAPH*' Module in SAS is considered time-consuming and 
cumbersome, even for the most experienced SAS programmers. The Graph Module of 
the invention provides a way to create simple but informative graphs, which allows the 
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user to graphically present data trends, to drill down for detail data listings or spot 
irforniaticn, and to export the graphs mio a loi of popular graphic formats such as .bmp, 
.gif, .pcx, etc. 

5 The Table Module in the invention can be used to generate three types of 

summary tables. It provides descriptive statistics such as means, standard deviations, and 
number of observations. The third style table module provides the option to count the 
number of patients or the number of observations in the output table. 

10 After an administrator sets up an adverse event file and variable names, the 

Adverse Event Module allows the user to view adverse events by unique patient count 
instead of observation count. It also separates treatments, body systems, and preferred 
terms, and provides percent information by treatment for comparison between different 
treatments. Features like these that usually require extensive data manipulation and table 

1 5 programming are simple when using the invention. 

The Reclassify Module provides a way of viewing the data from a different 
perspective. It translates data into a new grouping convention while maintaining format 
type. It automatically provides data range information such as maximum and minimum 
20 for numeric data and data values information for character data. 



Most of the modules mentioned above are linked to the primary data browsing 
and analysis modules to provide more flexibility and power. Also, data set and variable 

■l 

labels are displayed throughout the explorer to provide more information. 

25 

Thus, it is the object of this invention to provide a method and apparatus for real 
time examination and summarization of electronic databases. In particular, it is the 
object of this invention to provide a mechanism by which non-statisticians, non- 
programmer users can directly access information in electronic databases, without the 
30 requisite need to create computer programs for this purpose. Thus time wasted because 
of the necessity of involving a computer programmer to generate code for exploration of 
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15 



20 



25 



databases is avoided. Similarly, because the end user directly accesses the database, 
using an intuitive coiriputer iriterfact, the invcnticr, does not require significant training 
of end users in bio statistics or computer programming. 

These and other aspects of the invention will become apparent from the 
following description of the preferred embodiments taken in conjunction with the 
following drawings. As would be obvious to one skilled in the art, many variations and 
modifications of the invention may be effected without departing from the spirit and 
scope of the novel concepts of the disclosure. 

BRIEF DESCRIPTION OF THE FIGURES OF THE DRAWINGS 



FIG, 1 is a flow chart showing the organization of the user-accessible modules 
of one embodiment of the invention. 



FIG. 2 is a block diagram of a hardware configuration upon which a disclosed 
embodiment of the invention may run. 

DETAILED DESCRIPTION OF THE INVENTION 



A preferred embodiment of the invention is now described in detail. As used in 
the description herein and throughout the claims, the following terms take the meanings 
explicitly associated herein, unless the context dearly dictates otherwise: "a," "an," and 
"the" includes plural reference, "in" includes "in" and "on.** ' 

The invention may be embodied in a software program running in a digital 
computer. The complete source code for this embodiment is disclosed in the microfiche 
appendi>c, along with a user's guide that instructs the user how to operate all of the 
features of the embodiment. 



30 
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As shown in FIG. 1, the invention includes a series of . program modules 10, or 
subroutines, each of which performs specific functions. These include a user iogiu 20, 
and a select protocol module 22. The user is allowed to select between a data explorer 
module 24, a patient information module 26 and an adverse events charting module 28. 
5 . The data explorer module 24 allows the user to select between a report and analysis 
module 30, a browse data module 32, a find variable module 34, a view files module 36, 
a subgroup and join module 38, and a reclassify module 40, 

The report and analysis module 30 allows the user to select between a tables 
10 module 42, a listings module 50, a lab module 52, a statistics module 54, an INSIGHT 
module 56 and a graph module 58. The tables module 42 allows the user to select tables 
to be generated in one of three styles: a first style 44, as second style 46, and a third 
style 48. The graph module 58 allows the user to select between a mean module 60, a 
fi-equency module 62 and a plot module 64, 
15 . 

The view files module 36 allows the user to select between a view files & tables 
module 70 and a view WordPerfect* files module 72. The subgroup and join module 38 
allows the user to select between a create subgroup module 74, a join subgroup module 
76 and a join data module 78. The reclassify module 40 allows the user to select 
20 between a reclassify numeric variable module 80 and a reclassify character variable 
module 82. The functions associated with each of these modules is described below. 



The user login module 20 controls access to protocol data. Before a user can log 
into the system, an administrator must set up the user identification and password, and 
25 specify the authorized protocol data. 

The select protocol module 22 allows access lo authorized protocols Clinical 
research in the pharmaceutical industry is usually separated into drug, indication, and 
protocol levels, and the invention is designed to follow these conventions. An 
30 administrator must set up data access rights for each user for each specific drug, 
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indication, and protocol. Only authorized protocols can be accessed by the users. The 

J. u-... ^»U4 4a j»i„4» Jiii. ..... J. A- J - J L.. ^i. . 

UbCib uu uui iiavc iiic M^ui lu.uwi^iL. ui muuLiy any law ui aiitu^ais uaia piuviucu u^ UlC 

administrator, but the user can create any in-process data sets or export data into other 
formats for fiirther data manipulations. 

5 

The data explorer module 24 provides the interface to the various modules that 
allow the user to manipulate clinical data. Of these, the report and analysis module 30 
allows access to the primary clinical data display modules. 

10 Included under the report and analysis module 30 is the tables submodule 42, 

which is designed to create a variety of tables summarizing the data. Continuous 
variables can be summarized by a variety of statistics and can be grouped by class 
variables. The user can create 2-way cross-classification tables and can choose whether 
or not to include missing levels of a class variable in a table. Output from the tables 

15 submodule 42 can be customized by adding titles and/or footnotes. The user has the 
option to change the table font, and can choose to present the date and time of the 
output, the page number, and the page size (portrait or landscape). The tables can be 
saved and printed. There are three table styles to choose from labeled as first style 44, 
second style 46, and third style 48. 

20 

The first style table 44 is appropriate for summarizing continuous variables such 
as age, weight, and height. This style can be used to present up to 20 continuous 
variables whh as many of the following statistics as the user wishes to present: number 
of non-missing, number of missing, range, sum, mean, variance, maximum, minimum, 
25 standard deviation, standard error of the mean, coefficient of variation, student's t for 
testing the null hypothesis that the mean is zero and the corresponding p-vaiue; and 
corrected and uncorrected sums of squares. As an option the user can choose up to four 
grouping variables. This will create a separate table for each grouping variable 
combination. 
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The second style table 46 is appropriate for summarizing continuous variables 
such as age, weight, or height hy classification variables such ai race, sex, and treatment. 
This style can be used to present up to four continuous variables by two classification 
variables in a single table. A classification variable is required for second style and a 
5 single table is created, rather than a separate table for each combination of classification 
variables as in the first style table. 

The third style table 48 is appropriate for creating a 2-way cross-classification 
table such as treatment by race. The table gives the frequency and percent of each 
10 combination of the classification variables. Percents can be presented as overall, row, or 
column percents. The user can choose up to two column classification variables and up 
to two row classification variables to be presented in a single table. 

The graph submodule 58 is designed to create a variety of graphs summarizing 
15 the data. Output fi'om the graph submodule 58 can be customized by adding a title . 
and/or a footnote. Options such as title font are available in the menu bar, Available 
submodules included in the graph submodule 58 are mean 60, fi"equency 62, and plot 64. 

The mean submodule 60 is designed to create horizontal and vertical bar charts, 
20 and 3D horizontal and vertical bar charts for one continuous variable, grouped by a 
classification variable. Subgroupmg is also available. This submodule can be used to 
compare, for instance, the mean age between treatment groups, the mean age between 
genders, or the mean age between gender broken down by treatment groups. The output 
also includes the standard deviation of the response variable, as well as group frequency 
25 counts. 

The frequency submodule 62 is designed to create horizontal and vertical bar 
charts, 3D horizontal and vertical bar charts, pie charts, and 3D pie charts. The response 
and grouping variables must be classification variables, Subgrouping is also available. 
30 For data sets that include the key variable, the user can choose between patient counts 
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or event counts. For data sets that do not include key variable, only event counts can be 

used I ne USei dUl UUUSC lU pi&OL>«iL mu ^lapn ao m^u^jiiuj' wvfuilio wi uo p^»ww*»nj. *i*v 

frequency submodule 62 can be used, for instance, lo view the race distribution of 
subjects in a study, the gender distribution, or to view the race distribution among the 
5 treatment groups. One can compare the number of adverse events that occurred per 
treatment group, or the number of subjects v^ho experienced an adverse event per 
treatment group. 

The plot submodule 64 is designed to create line, scatter, or needle plots. The 
10 plot submodule 64 can be used, for example, to create a scatter plot of baseline versus 
final lab values to visually show a trend or to reveal outliers. The user can click on the 
outlier point to reveal such information as the patient number corresponding to the 
outlier, the treatment group that the patient is in, the patient's sex, age, and the x and y 
coordinates of the point. A.s another example the user can first create mean efficacy 
15 values by treatment and time using the statistics module 54. The plot submodule 64 can 
then be accessed from the statistics submodule 54 to create a line graph of efficacy 
values over time for each treatment group. 

The listings submodule 50 is used to create data listings in a desirable format and 
20 layout. The variables listed in the output are limited and sorted by user specified 

variables. The user has the option to subset the data before creating a list. Output from 
the listings submodule 50 can be customized by adding titles and/or footnotes, or 
choosing from other available options. 

25 The lab module 52 is used to view the laboratory data set, This module provides 

a gateway to explore the fiinctionality in LAB* optionally implemented in some SAS* 
products. 



30 



The statistics submodule 54 is used to produce statistics for continuous data 
such as age, height, and weight. Available statistics are number of non-missing, number 
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of missing, range, sum, mean, variance, minimum, maximum, standard deviation, 
standard error of the mean, coefEcient cf vsristion, skcwncss, oDu Miuosis. Grouping is 

, I » 

available in the statistics submodule 54. The graph submodule 58 can be accessed from 
the statistics submodule 54 so that the user can produce, for instance, a bar chart 

5 comparing mean age among treatment groups or mean change from baseline in efficacy 
variables among treatment groups. The user can also run the statistics submodule 54 to 
get mean efficacy values by. treatment and time and then access the graph submodule 58 
to create a line plot of mean efficacy values per treatment over time. Output from the 
statistics submodule 54 can be customized by adding titles and/or footnotes. The user 

10 can choose to present the date and time of the output, the page number, and the page 
size (portrait or landscape). 

The insight submodule 56 is used to view the data set under the INSIGHT* 
module optionally implemented in some SAS products. 

15 

The browse data submodule 32 in the invention is designed to perform such 
functions as viewing, searching, sorting, saving , and printing data sets, and exporting 
data sets to an external file. The user can create new data sets from existing data sets 
with such options as select variables to keep, select variables to drop, and rename 
20 variable. The delete function allows the user to delete data sets he has created. Original 
data sets can not be deleted. 

The find variable submodule 34 is used to identify the data sets that contain a 
particular variable, The find variable submodule 34 includes the ability to view data 
25 sets, to sort data sets, to save sorted data sets, to print data sets, and to export data sets 
to another file format. This module is linked to report & analysis submodule 30 and to 
the Browse Data Screen. The Browse Data Screen is similar to the browse data module 
32 but lacks the Delete function. 



30 
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The view files submodule 36 is used to view output files and tables created in the 
j«y*jrtf jrjTi^ \np\v Other ASCII fi!?s, 3.nd to vicvy WordPerfect files. Tv/o subrnodules 
are available in the view files submodule 36; view files & tables 70, and view 
WordPerfect* files 72. From the view files & tables submodule 70 the user may view 
5 and print files and tables created in the invention and can choose options such as view 
font and page size (portrait or landscape). From the view WordPerfect files submodule 
72 the user may invoke the software product WordPerfect. 

The subgroup & join submodule 38 is organized into three groups: create 
10 subgroup 74, join El subgroup 76, and join data 78. These modules are linked to the 

report & analysis submodule 30 and the patient information module 26 so that functions 
available in these modules can be performed directly from the subgroup 8l join 
submodule 38. The create subgroup module 74 allows the user to create a subset from • 
an existing data set and save the subset for later usage. For example, the user can create 
15 data sets containing all patients with adverse event equal to *headache\ containing only 
males, containing only those patients between 30 and 45 years of age, or a data set 
containing only males between the ages of 30 and 45. The join subgroup submodule 76 
is designed to join a subgroup created using the create subgroup module 74 with 
another data set with the same key variable. This allows the user to create a data set 
20 including only the subjects which were identified in the create subgroup module 74 The 
join data module 78 allows the user to join two data sets together, by a key variable. 
The user can select all variables or selected variables to be kept in the new joined data 
set. This module also validates the data set that user selected. 

I 

25 The reclassify submodule 40 is used to create new variables from existing 

variables. The data set containing the new variables can be saved for later usage. The 
reclassify submodule 40 is linked to the report & analysis submodule 30 and the patient 
information module 26 so that functions available in these modules can be directly 
accessed fi-om the reclassify submodule 40. Two submodules are available under the 

30 reclassify submodule 40: the reclassify numeric variable submodule 80 and the reclassify 
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character variable submodule 82. The reclassify numeric variable module 80 allows the 
user to reclassify numeric variables. As an example, the user can create a nsvv 
classification variable 'newage' by grouping the numeric variable 'age' into levels such 
as <30, 30-45, and >45. The reclassify character variable module 82 allows the user to 
5 reclassify the variable according to the user's own grouping criteria. The user can select 
a data set and variables from the screen. All the existing values of the variable will be 
displayed for the user. The user can group different values of the variable to create a 
new variable. 

10 The patient information module 26 allows the reviewer to browse the patient 

profiles. The Patient Information module also allows the user to view a subset of any 
data in the protocol, grouped by the subject identifier. It can also be invoked from other 
modules to view only the patient information in the current file that the user wishes to - 
work with. 

15 

The adverse event module 28 allows the user to determine the fi-equency and 
percentage of patients with adverse events, grouped according to treatment, body 
system, and preferred term. It gels the information from an adverse event data set which 
was set up by the administrator, and displays the fi-equency and percentage values in an 
20 organizational chart format. The chart has three levels: treatment, body system, and 
preferred term. The module also provides the function to "drill down' to a subset data 
set or to individual patient information. 

In order to run the embodiment disclosed in the microfiche appendix, a stand 
25 alone or networked personal computer (PC) running Windows^ 3.1 or Windows™ for 
Workgroups 3.11 and MS-DOS* 6.x, or higher, would be sufficient, However, 
Windows 95™ or Windows NT™ platforms are recommended for optimum 
performance. 
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The minimum hardware requirements for the embodiment 100 disclosed in the 
appendix are: inter 486DX2 66 MHZ CPU 102, 256 KB cache, 32 MB RAM 104, a 
super VGA 15" 800X600 color moriitor 106, and a Hewlett Packard (HP*' Laser printer 
108 or InkJet printer. Minimum free hard disk space required depends on the size of the 
5 database and the location of the SAS software. A minimum of 100 to 150 MB of free 
hard disk space are recommended. As the possibility oif running multiple applications 
increases, the recommended minimum free hard drive space will increase 
commensurately too. One representative hardware configuration that works well with 
the above-disclosed embodiment includes an Intel* Pentium 90 MHZ CPU, a 256 KB 
10 cache, 64 MB RAM, 1 50 MB free hard drive space, and a super VGA 1 T color 
monitor. 



As is obvious to those skilled in the art of computer software design, the above- 
disclosed invention could be readily adapted to operate on other computer platforms. 
15 including Unix* and Macintosh* platforms. 

One embodiment of the system was developed under SAS* System 6. 1 1, and 
utilizes newly developed features like object oriented programming, data table object 
functions, and 'drag and drop' on-screen editing. The SAS* modules required for this 
20 application are SAS/BASE*, SAS/CORE*, SAS/AF*, SAS/FSP*, SAS/ ACCESS*, 
S AS/ST AT*, and S AS/GRAPH®. The invention provides the gateways to access 
SAS/TNSIGHT* and SAS/LAB® modules. Competing programs that use other 
programming languages require conversion of SAS* data, which can cause errors. The 
invention eliminates this time-consuming and problematic step. 



25 



The SAS* data sets are defined for the most effective and efficient use. 
Although users can do extensive manipulation on the data sets, analysis data sets with 
the appropriate structure and sufficient information are provided. This prevents the user 
from spending unnecessary time manipulating data instead of reviewing data. 
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There are two basic data set layouts, which are referred to herein as ^Vertical" 
anH "horizontal." in a venica! layout, a subject can have mure than one associated 
record, whereas in a horizontal layout, each subject has only one record. 

5 The following is an example of a vertical layout: 
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20 and the following is an example of the horizontal layout: 
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Both of these data sets contain the same information, but each subject in the second 
table contains only one record, whereas several records may be assigned to each subject 
in the first table. 
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Some data displays, such as graphs, require data in^the vertical layout, while 
others will require the horizontal layout. When both layouts are provided, then the user 
can select the one that is required and quickly create the table or graph of interest. 

5 Every data set contains variables that are frequently needed to create tables, 

graphs or lists. Demographic variables, such as sex, race and age, as well as the 
treatment code, are included in every data set. This saves the user time because he or 
she will not need to merge data sets together to gel the needed variables into one data 
set. Likewise, e£Ficacy data sets and lab data sets contain the change from baseline value 

10 at each time point. 

The type of patient sample used to create graphs or tables can vary. For 
example, safety tables, such as adverse events, labs and vital signs may be created from 
the intent-to-treat safety sample! Efficacy tables may be created from the intent-to-treat 
15 efficacy sample. Indicator variables are included in every data set and can be used to 
extract the appropriate patient sample for the graph or table that the user wants to 
create. 

The above described embodiments are given as illustrative examples only. It will 
20 be readily appreciated that many deviations may be made from the specific embodiments 
disclosed in this specification without departing from the invention. Accordingly, the 
scope of the invention is to be determined by the claims below rather than being limited 
to the specifically described embodiments above. 
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CLAIMS 



What is claimed is: 



5 1. A method for processing clinical trial databases, comprising the steps of: 

a. assembling data from at least one clinical trial database into a data file 
hawng a preselected format; 

b. generating from the data file a table in a selected one of a first style, a 
second style or a third style; and 

10 c. generating a graph representing a selected subset of the data file. 

2. The method of Claim 1, fiirther comprising the steps of 

a. creating a subset from an existing data set; and 

b. joining a subgroup created with another data set with having a same key 
15 variable. 



3 . An apparatus for processing clinical trial databases, comprising: 

a. means for assembling data from at least one clinical trial database into a 
data file having a preselected format; 
20 b. means for generating from the data file a table in a selected one of a first 

style, a second style or a third style; and 
c. means for generating a graph representing a selected subset of the data 
file. 



25 4. The apparatus of Claim 3, fiirther comprising: 

a. means for creating a subset firom an e?dsting data set; and, 

b. means for joining a subgroup created with another data set with having i 
same key variable. 
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